A Large Vocabulary Semantic Network for Computerised Speech Recognition
نویسندگان
چکیده
The work presented in this paper deals with the construction of a large-vocabulary semantic network to assist computerised speech or text recognition. The semantic network is systematically constructed with semantic information about nouns and verbs from the Longman Dictionary of Contemporary English by the application of pattern matching rules. It is represented in the form of a directed graph where nodes represent word senses and links represent the types of conceptual relationships. A semantic score, for pairwise combinations of word candidates in a speech recognition lattice, can be derived by traversing the network and calculating the conceptual distance between the senses of these candidates.
منابع مشابه
Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting
Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...
متن کاملContinuous word representation using neural networks for proper name retrieval from diachronic documents
Developing high-quality transcription systems for very large vocabulary corpora is a challenging task. Proper names are usually key to understanding the information contained in a document. One approach for increasing the vocabulary coverage of a speech transcription system is to automatically retrieve new proper names from contemporary diachronic text documents. In recent years, neural network...
متن کاملProsodically-enhanced recurrent neural network language models
Recurrent neural network language models have been shown to consistently reduce the word error rates (WERs) of large vocabulary speech recognition tasks. In this work we propose to enhance the RNNLMs with prosodic features computed using the context of the current word. Since it is plausible to compute the prosody features at the word and syllable level we have trained the models on prosody fea...
متن کاملLemmatized Latent Semantic Model for Language Model Adaptation of Highly Inflected Languages
We present a method to adapt statistical N-gram models for large vocabulary continuous speech recognition of highly inflected languages. The method combines morphological analysis, latent semantic analysis (LSA) and fast marginal adaptation for building topic-adapted trigram models, based on a background language model and very short adaptation texts. We compare words, lemmas and morphemes as b...
متن کاملUse of Knowledge Graph in Rescoring the N-Best List in Automatic Speech Recognition
With the evolution of neural network based methods, automatic speech recognition (ASR) field has been advanced to a level where building an application with speech interface is a reality. Inspite of these advances, building a real-time speech recogniser faces several problems such as low recognition accuracy, domain constraint and out-of-vocabulary words. The low recognition accuracy problem is...
متن کامل